skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Joseph-McCarthy, Diane"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Free, publicly-accessible full text available August 1, 2026
  2. Free, publicly-accessible full text available December 1, 2025
  3. Target identification and evaluation is a critical step in the drug discovery process. Although time-intensive and complex, the challenge becomes even more acute in the realm of infectious disease, where the rapid emergence of new viruses, the swift mutation of existing targets, and partial effectiveness of approved antivirals can lead to outbreaks of significant public health concern. The COVID-19 pandemic, caused by the SARS-CoV-2 virus, serves as a prime example of this, where despite the allocation of substantial resources, Paxlovid is currently the only effective treatment. In that case, significant effort pre-pandemic had been expended to evaluate the biological target for the closely related SARS-CoV. In this work, we utilize the computational hot spot mapping method, FTMove, to rapidly identify and rank binding sites for a set of nine SARS-CoV-2 drug/potential drug targets. FTMove takes into account protein flexibility by mapping binding site hot spots across an ensemble of structures for a given target. To assess the applicability of the FTMove approach to a wide range of drug targets for viral pathogens, we also carry out a comprehensive review of the known SARS-CoV-2 ligandable sites. The approach is able to identify the vast majority of all known sites and a few additional sites, which may in fact be yet to be discovered as ligandable. Furthermore, a UMAP analysis of the FTMove features for each identified binding site is largely able to separate predicted sites with experimentally known binders from those without known binders. These results demonstrate the utility of FTMove to rapidly identify actionable sites across a range of targets for a given indication. As such, the approach is expected to be particularly useful for assessing target binding sites for any emerging pathogen, as well as for indications in other disease areas, and providing actionable starting points for structure-based drug design efforts. 
    more » « less
  4. The goal of this paper is predicting the conformational distributions of ligand binding sites using the AlphaFold2 (AF2) protein structure prediction program with stochastic subsampling of the multiple sequence alignment (MSA). We explored the opening of cryptic ligand binding sites in 16 proteins, where the closed and open conformations define the expected extreme points of the conformational variation. Due to the many structures of these proteins in the Protein Data Bank (PDB), we were able to study whether the distribution of X-ray structures affects the distribution of AF2 models. We have found that AF2 generates both a cluster of open and a cluster of closed models for proteins that have comparable numbers of open and closed structures in the PDB and not too many other conformations. This was observed even with default MSA parameters, thus without further subsampling. In contrast, with the exception of a single protein, AF2 did not yield multiple clusters of conformations for proteins that had imbalanced numbers of open and closed structures in the PDB, or had substantial numbers of other structures. Subsampling improved the results only for a single protein, but very shallow MSA led to incorrect structures. The ability of generating both open and closed conformations for six out of the 16 proteins agrees with the success rates of similar studies reported in the literature. However, we showed that this partial success is due to AF2 “remembering” the conformational distributions in the PDB and that the approach fails to predict rarely seen conformations. 
    more » « less
    Free, publicly-accessible full text available November 26, 2025
  5. Introduction:A fundamental challenge in computational vaccinology is that most B-cell epitopes are conformational and therefore hard to predict from sequence alone. Another significant challenge is that a great deal of the amino acid sequence of a viral surface protein might not in fact be antigenic. Thus, identifying the regions of a protein that are most promising for vaccine design based on the degree of surface exposure may not lead to a clinically relevant immune response. Methods:Linear peptides selected by phage display experiments that have high affinity to the monoclonal antibody of interest (“mimotopes”) usually have similar physicochemical properties to the antigen epitope corresponding to that antibody. The sequences of these linear peptides can be used to find possible epitopes on the surface of the antigen structure or a homology model of the antigen in the absence of an antigen-antibody complex structure. Results and Discussion:Herein we describe two novel methods for mapping mimotopes to epitopes. The first is a novel algorithm named MimoTree that allows for gaps in the mimotopes and epitopes on the antigen. More specifically, a mimotope may have a gap that does not match to the epitope to allow it to adopt a conformation relevant for binding to an antibody, and residues may similarly be discontinuous in conformational epitopes. MimoTree is a fully automated epitope detection algorithm suitable for the identification of conformational as well as linear epitopes. The second is an ensemble approach, which combines the prediction results from MimoTree and two existing methods. 
    more » « less
  6. Krishnan, Viswanathan V. (Ed.)
    At the outset of an emergent viral respiratory pandemic, sequence data is among the first molecular information available. As viral attachment machinery is a key target for therapeutic and prophylactic interventions, rapid identification of viral “spike” proteins from sequence can significantly accelerate the development of medical countermeasures. For six families of respiratory viruses, covering the vast majority of airborne and droplet-transmitted diseases, host cell entry is mediated by the binding of viral surface glycoproteins that interact with a host cell receptor. In this report it is shown that sequence data for an unknown virus belonging to one of the six families above provides sufficient information to identify the protein(s) responsible for viral attachment. Random forest models that take as input a set of respiratory viral sequences can classify the protein as “spike” vs. non-spike based on predicted secondary structure elements alone (with 97.3% correctly classified) or in combination with N-glycosylation related features (with 97.0% correctly classified). Models were validated through 10-fold cross-validation, bootstrapping on a class-balanced set, and an out-of-sample extra-familial validation set. Surprisingly, we showed that secondary structural elements and N-glycosylation features were sufficient for model generation. The ability to rapidly identify viral attachment machinery directly from sequence data holds the potential to accelerate the design of medical countermeasures for future pandemics. Furthermore, this approach may be extendable for the identification of other potential viral targets and for viral sequence annotation in general in the future. 
    more » « less
  7. The rapid emergence of immune-evading viral variants of SARS-CoV-2 calls into question the practicality of a vaccine-only public-health strategy for managing the ongoing COVID-19 pandemic. It has been suggested that widespread vaccination is necessary to prevent the emergence of future immune-evading mutants. Here, we examined that proposition using stochastic computational models of viral transmission and mutation. Specifically, we looked at the likelihood of emergence of immune escape variants requiring multiple mutations and the impact of vaccination on this process. Our results suggest that the transmission rate of intermediate SARS-CoV-2 mutants will impact the rate at which novel immune-evading variants appear. While vaccination can lower the rate at which new variants appear, other interventions that reduce transmission can also have the same effect. Crucially, relying solely on widespread and repeated vaccination (vaccinating the entire population multiple times a year) is not sufficient to prevent the emergence of novel immune-evading strains, if transmission rates remain high within the population. Thus, vaccines alone are incapable of slowing the pace of evolution of immune evasion, and vaccinal protection against severe and fatal outcomes for COVID-19 patients is therefore not assured. 
    more » « less